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Abstract 

Background: Insulin producing beta cell and glucagon producing alpha cells are colocalized in pancreatic islets in 
an arrangement that facilitates the coordinated release of the two principal hormones that regulate glucose 
homeostasis and prevent both hypoglycemia and diabetes. However, this intricate organization has also 
complicated the determination of the cellular source(s) of the expression of genes that are detected in the islet. 
This reflects a significant gap in our understanding of mouse islet physiology, which reduces the effectiveness by 
which mice model human islet disease. 

Results: To overcome this challenge, we generated a bitransgenic reporter mouse that faithfully labels all beta and 
alpha cells in mouse islets to enable FACS-based purification and the generation of comprehensive transcriptomes 
of both populations. This facilitates systematic comparison across thousands of genes between the two major 
endocrine cell types of the islets of Langerhans whose principal hormones are of cardinal importance for glucose 
homeostasis. Our data leveraged against similar data for human beta cells reveal a core common beta cell 
transcriptome of 9900+ genes. Against the backdrop of overall similar beta cell transcriptomes, we describe marked 
differences in the repertoire of receptors and long non-coding RNAs between mouse and human beta cells. 

Conclusions: The comprehensive mouse alpha and beta cell transcriptomes complemented by the comparison of 
the global (dis)similarities between mouse and human beta cells represent invaluable resources to boost the 
accuracy by which rodent models offer guidance in finding cures for human diabetes. 
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Background 

Pancreatic beta and alpha cells are clustered together in 
pancreatic islets to ensure tight coordination of the se- 
cretion of insulin and glucagon, whose opposing actions 
on hepatic glucose metabolism are essential for glucose 
homeostasis. Yet, despite the functional juxtaposition of 
insulin and glucagon, pancreatic beta and alpha cells de- 
rive from a common progenitor population that during 
embryonic development is first uniquely defined by the 
expression of the basic helix-loop-helix transcription fac- 
tor neurogenin 3 (NeurogS) [1]. Under the influence of 
lineage-specific sets of transcription factors, Neurog3+ 
early endocrine progenitors undergo stepwise differen- 
tiation along parallel lineages to develop into mature 
alpha and beta cells [2-5]. It follows that upon comple- 
tion of their differentiation and maturation, alpha and 
beta cells, by virtue of the secretion of their signature 
hormones glucagon and insulin, have adapted to func- 
tionally opposing roles in glucose metabolism. Insulin 
acts on most peripheral tissues to facilitate the clearance 
of glucose from the general circulation by inhibiting 
hepatic gluconeogenesis and promoting hepatic glycogen 
synthesis and glucose uptake in skeletal muscle and adi- 
pocytes [6,7]. Furthermore, insulin conveys important 
metabolic feedback to hypothalamic feeding centers 
regarding the energy state of the body [8]. In contrast, 
the hepatic actions of glucagon, which include the 
stimulation of glycogenolysis, provide an essential coun- 
ter-regulatory mechanism to ensure stable glycemic 
control [9]. The serious consequences of the absolute 
insulin deficiency that occur secondary to autoimmune 
attack of the beta cells in type 1 diabetes as well as the 
relative insulin deficiency characteristic of type 2 dia- 
betes illustrate the critical importance of insulin. How- 
ever, the view that hyperglucagonaemia contributes 
significantly to the progression of both major forms of 
diabetes has gained acceptance [10]. It follows that a 
comprehensive understanding of both the alpha and the 
beta cell and how they regulate glucose homeostasis 
through the coordination of their activity quite possibly 
will hold the key to therapeutic interventions aimed at 
curbing the current diabetes epidemic. 

The intimate clustering of beta and alpha cells in 
pancreatic islets subserves the tight coordination of the 
secretion of insulin and glucagon, but has long compli- 
cated the process of obtaining highly pure populations 
of alpha and beta cells from isolated islets. To overcome 
this challenge, we have developed a reporter strain that 
fluorescently marks all beta cells by the nuclear expres- 
sion of mCherry under control of the mouse insulin 1 
(Insl) promoter and crossed it to a GFP reporter line 
that labels alpha cells. Bitransgenic offspring of a cross 
between both lines then enabled the collection of purified 
populations of alpha and beta cells from the same islets. 



These tools enabled the first transcriptome-wide com- 
parison of murine alpha and beta cells by RNA-seq, which 
revealed over 2500 genes that are differentially expressed 
(p-value < 1 X 10 '', false discovery rate (FDR) < 0.1%) bet- 
ween both populations, with insulin the second-most 
significantly enriched in the beta cell fraction. Next gene- 
ration sequencing combines the tremendous benefits in 
sensitivity and dynamic range of techniques traditionally 
used to assess expression of targeted sets of genes, such as 
quantitative PGR, with the scale of microarray and the dis- 
tinct benefit that no a priori sequence information is re- 
quired, which enables novel transcript discovery. We talce 
advantage of our data to conduct the first transcriptome- 
wide assessment of the similarities and differences that 
exist between rodent and human beta cells. These data 
provide an unprecedented global view of the core beta cell 
transcriptome that is conserved across species, while sim- 
ultaneously highlighting marked differences in the expres- 
sion of receptors and associated long non-coding RNAs 
(IncRNAs) between beta cells of both species. An un- 
biased and comprehensive comparison between mouse 
and human beta cells is an invaluable resource that can fa- 
cilitate the translation of preclinical findings in rodent 
models towards therapeutic strategies aimed at alleviating 
or curing diabetes. 

Results 

Generation and validation of mlns1-l-l2b-mCherry and 
SIOOb-eGFP reporter lines 

While available beta cell reporter lines such as the MIP- 
GFP and MIP-RFP lines [11], facilitate the purification 
of pure populations of beta cells, reporter expression in 
these lines is mosaic, which leaves as much as half or 
more of all beta cells not marked [12]. To obtain a beta 
cell reporter line that faithfully and selectively labels all 
beta cells, we generated a transgenic reporter line by 
cloning an open reading frame encoding a fusion protein 
between human histone-2b (H2b) and monomeric 
Cherry (mCherry) downstream of a 8.3 kB mouse insulin 
1 promoter (generously gifted by Dr. Mark Magnuson) 
(Figure lA). The resulting mInsl-H2b-mGherry trans- 
genic reporter mice demonstrate nuclear mGherry ex- 
pression in all beta cells (Figure IB) and are healthy, 
viable and normoglycemic with no sign of glucose in- 
tolerance (Figure IG, D). To co-label alpha cells, we 
capitalized on the serendipitous discovery that enhanced 
green fluorescent protein (eGFP) expression in the 
SlOOb-eGFP transgenic reporter line [13] labels alpha 
cells within the islet (Additional file 1). As there is no 
detectable expression of endogenous SlOOb transcript in 
purified SlOOb + alpha cells, the expression of eGFP in 
alpha cells is an artifactual, but useful trait that enables 
the purification of alpha cells by FAGS. 
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Figure 1 Generation of a beta cell reporter mouse that faithfully and selectively marks all beta cells. A fusion of histone-2b (H2b) and 
monomeric cherry (mCherry) was inserted downstream of the long Insl promoter fragment (A) to generate a mlns1-H2b-mCherry beta cell 
reporter mouse that features nuclear expression of mCherry in all beta cells (B). Male (C) and female (D) mlns1-H2b-mCherry mice demonstrate 
normal glucose control as demonstrated by glucose tolerance test compared to wild type littermates. 



Whole transcriptome analysis of highly enriched mouse 
beta and alpha cells 

Islets isolated from two replicate groups of bitransgenic 
offspring of a cross between the mInsl-H2b-mCherry 
and SlOOb-eGFP reporter lines (Figure 2A) enable the 
simultaneous purification of alpha and beta cells by 
FACS (Figure 2B). We detected not a single read in our 
eGFP + alpha cell fractions that maps to mCherry, and 
only three reads that map to eGFP in our mCherry + 
beta cell fractions, underscoring the quality of our FACS 
purification strategy (Figure 2C, D). Furthermore, while 
the expression of the endogenous Insl gene measures in 
at an average RPKM (reads per kilobase of exon model 
per million mapped reads) [14] value of approximately 
230,000 (Additional file 2), the use of the 10 kb mouse 
Insl promoter to drive H2b-mCherry transcription in 
the beta cells results in mCherry RPKM values of only 
slightly over 5. This relatively low mRNA expression 
despite the use of one of the strongest promoters in the 
beta cell context partially explains the relatively dim 
mCherry signal in the nuclei of beta cells and may have 
fortuitously contributed to the fact that our mInsl-H2b- 
mCherry beta cell reporter mice are healthy and do not 
display or develop any beta cell defects that would pre- 
cipitate diabetes (Figure IC, D). 

A comprehensive comparison of the transcriptomes 
of alpha and beta cells revealed 2547 genes that were 



differentially expressed between beta and alpha cells. A 
total of 1075 genes were significantly (p-value < 1 x 10"^, 
FDR < 0.1%) enriched in beta cells. The Insulin 2 {Ins2) 
and Iml genes are the second- and third most sig- 
nificantly enriched genes in beta cells compared to alpha 
cells, after the beta cell-specific transcription factor 
Mafa (Figure 2E; Additional file 2), which serves as a po- 
tent confirmation of the high purity of the beta cell 
population obtained by the use of our mInsl-H2b- 
mCherry reporter mouse. Other genes that were highly 
significantly enriched in mouse beta cells are genes 
encoding the prolactin receptor [Prlr) and the peptide 
hormone UrocortinS {UcnS) (Figure 2E). Conversely, 1472 
genes were sig nificandy (p-value < 1 x 10 , FDR < 0.1%) 
enriched in the alpha cell fraction, including the mouse 
alpha cell-specific transcription factors Arx, Irxl, Mafb, as 
well as glucagon {Gcg) (Additional file 3). It is no surprise 
that Ins2 and Insl are the most abundantly detected tran- 
scripts in beta cells, comprising on average 13.0% and 
5.1%, respectively, of all mapped reads in the beta cell 
libraries (Figure 2F). In the alpha cell-enriched libraries, 
Gcg is responsible for on average 15.7% of all mapped 
reads. Expression of other islet hormones, including pan- 
creatic polypeptide {Ppy), somatostatin {Sst), peptide YY 
{Pyy) and islet amyloid polypeptide (lapp), is between 6.5 
and 33-fold lower than the expression of Gcg, which con- 
firms the successful enrichment of alpha cells (Figure 2G). 
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Figure 2 Validation of comprehensive transcriptomes of mouse beta and alpha cells. Bitransgenic offspring of mlns1-H2b-mCherry x 
SIOOb-eGFP bitransgenic reporter mice (A) enable the FACS purification of pure populations of beta and alpha cells (B). Expression of eGFP is 
negligible in mCherry + beta cells (C), and mCherry expression in eGFP + alpha cells is not detected (D). A volcano plot highlights key alpha and 
beta cell-enrlched genes (E). The most abundantly expressed genes in beta (F) and alpha (G) cells are expressed by the fraction of total reads that 
maps to each gene and compared to expression in the opposite cell type. Expression of the beta cell markers Insl (H), Ins2 (I), Ucn3 (J) and Mafa 
(K) in alpha cells measures on average less than 2.39% of expression in beta cells. Conversely, expression of the alpha cell markers Gcg (L), Arx 
(M), Irxl (N) and Mafb (O) in mouse beta cells is on average even lower. See also Additional files 1, 2 and 3. 
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While the abundance of Ins2IInsl and Gcg transcripts is 
expected in beta- and alpha cell-enriched libraries, re- 
spectively, the fraction of reads that maps to insulin is 
lower than for human beta cells, where upwards of 38% of 
all mapped reads in FACS -purified beta cells were as- 
signed to INS [15,16]. As a further validation of our FACS 
purification strategy, expression of known mouse beta 
cell-specific genes in the alpha cell fraction, including 
Insl, Ins2, Ucn3 [17,18] and Mafa [19], was on average 
less than 2.39% of the expression detected in beta cells 
(Figure 2H-K). Conversely, transcripts encoding the alpha 
cell-transcription factors Arxl, Irxl and Mafb and Gcg, 
which are all highly significantly enriched in alpha cells 
(Additional file 3), are detectable in the beta cell fraction 
at on average less than 1.49% of their expression in alpha 
cells (Figure 2L-0). We compared our alpha and beta cell 
transcrip tomes to a previous study that contrasted FACS- 
purified MIP-GPF + beta cells to transcriptomes of intact 
islets [20] . Their effort was somewhat limited by the afore- 
mentioned mosaicism of the MIP-GFP reporter line [11], 
which reduced the contrast between the transcriptomes of 
MIP-GFP + beta cells and total islets [20]. This manifested 



as the unremarkable enrichment of key beta cell markers 
including Ins2, Insl, Mafa and others in this study com- 
pared to our work (Additional file 4). 

Functional validation of alpha and beta cell 
transcriptomes 

While global pathway analysis of beta cell-enriched genes 
by DAVID [21] reveals significant association with key- 
words linked to aspects of beta cell function and disease 
(Additional file 5), generic functional categories can lack 
the resolution to discriminate between relatively similar 
cell types such as alpha and beta cells. We therefore 
assessed the expression of the transcription factors that 
determine alpha and beta cell identity, often in a mutually 
repressive fashion (Figure 3 A) [4,22]. The abundant ex- 
pression of the transcription factors Pdxl, Nkx6.1, Mafa 
and Mnxl in mature beta cells (Figure 3B) is in close 
agreement with several reports that demonstrate their 
expression to be necessary for mature beta cell identity 
and function [4,23-25]. Mafb, Arx, Irxl and Irx2 are sig- 
nificantly enriched in mouse alpha cells. Interestingly, 
many transcription factors that are required for proper 
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Figure 3 Comparison of mouse beta and alpha cell transcriptomes based on functional annotation. Establishment and maintenance of 
alpha and beta cell identity is regulated by a complex interplay of transcription factors (A), whose expression pattern is accurately reflected by 
our mouse alpha and beta cell transcriptomes. Actual transcription factor expression in beta and alpha cells is represented by a dot plot where 
each dot represents an individual gene (B). Genes that are significantly enriched (p < 1 x 10"') in beta or alpha cells are highlighted in red and 
green, respectively, with genes enriched with p< 1 x 10~™ displayed in bold for emphasis. Insl, Ins2, and Gcg are marked for reference. See also 
Additional files 5 and 6. 
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islet development and function, such as Pax6, Neurodl, 
Ml, Rfx6, Nkx2.2 , Mytl, Foxol, and Foxa2 are robustly 
expressed in both alpha and beta cell fractions and are not 
significantly, or only modestly enriched in either popu- 
lation. NeurogS is required for the formation of the 
endocrine pancreas [1] and remains detectable at relatively 
low levels (Figure 3B), but is depleted from beta cells 
(Figure 3B) in contrast to reports that attribute NeurogS 
expression in adult islets to mature beta cells [26-28]. 
Pax4 is important to repress the alpha cell-determining 
transcription factor Arx and facilitate differentiation along 
the beta and delta lineage [22,29], but is undetectable in 
mature mouse beta cells (Figure 3B) [30]. We similarly 
compared the relative enrichment of key genes involved in 
glucose sensing, stimulus secretion coupling and insulin 
exocytosis between alpha and beta cells and found that 
the facilitated glucose transporter Slc2a2 (a.k.a. Glut2), 
glucose-6-phosphate convertase 2 {G6pc2), proprotein 
convertase subtUisin/kexin type 1 (Pcskl), Ero 1-like beta 
(Erollb), and synaptotagmin-like 4 {Sytl4, a.k.a. granuphilin) 



were highly significantly enriched in beta cells, while genes 
encoding the ATP-sensitive potassium channel and volt- 
age-gated calcium channels were equally abundantly ex- 
pressed in alpha and beta cells (Additional fOe 6). 

Differences between mouse and human islet cells 

Mice are widely used as models for islet disease, despite 
marked differences between rodent and primate islet 
architecture and innervation [31-34]. The full transcrip- 
tomes of mouse beta and alpha cells enable a comprehen- 
sive comparison of both the similarities and differences in 
gene expression between mouse and human alpha and 
beta cells. To validate the approach, we compared the ex- 
pression of genes with known species differences between 
mouse and human alpha and beta cells. Mafa and Mafb 
are selectively expressed in adult mouse alpha and beta 
cells, respectively (Figure 4A) [19,35]. In contrast, while 
Mafa expression is similarly restricted to human beta 
cells, Mafb expression is retained in human beta cells and 
therefore not enriched in either human islet population 
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Figure 4 Validation of known differences in gene expression between mouse and human alpha and beta cells. We assessed the 
expression of two sets of genes, Mafa/Mafb and Ucn3/Crh, that markedly differ in distribution between alpha and beta cells in mouse (A) and 
human (B) islets. Mafa is restricted to beta cells of both species, while Mafb is selectively expressed in mouse alpha cells (A), but expressed in 
human beta cells as well (B). Ucn3 is highly selective for mouse beta cells while Crh is detected in neither mouse islet population (A, C, D). By 
contrast, LICN3 expression is a common feature of human alpha and beta cells, while CRH is enriched in human alpha cells (B, E, F). Human 
RNA-seq data are from [15]. See also Additional file 7 for single channel images. 
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(Figure 4B) [36,37]. As a second example of species dif- 
ferences between mouse and human islets, Ucn3 is highly 
selectively expressed in mouse beta cells [17,18] (Figure 2E; 
Figure 4A, C; Additional file 7) but is a common feature 
of human alpha and beta cells [17]. This is reflected by 
robust expression of human UCN3 but lack of enrichment 
in either human islet cell type (Figure 4B, E; Additional 
file 7). The gene encoding the related peptide hormone 
corticotropin-releasing hormone (Crh), best known as the 
principal hypothalamic initiator of the stress response, is 
not detectable in mouse islets (Figure 4A, D) but is highly 
enriched in human alpha cells (Figure 4B, F; Additional 
file 7), likely explaining the endogenous insulinotropic 
activity in human islets that is blocked by the selective 
CRHi receptor blocker antalarmin [12]. 

Comprehensive comparison of mouse and human beta 
cell transcriptomes 

Having thus validated our approach, we proceeded with 
a global comparison of the genes enriched in mouse and 
human beta cells using a comparable set of transcrip- 
tomes for human FACS -purified beta cells [16]. One has 
to be very careful when conducting cross-species com- 
parisons of next-generation sequencing libraries that are 
generated by different investigators using different pro- 
tocols. To minimize the potential caveats associated with 
such a comparison, we built in several safeguards in our 
analysis pipeline. We reanalyzed the raw human se- 
quence files in the same manner as our mouse data and 
used transcriptome data from beta cells of both species 
expressed as RPKM to allow direct, unbiased compa- 
rison. This yielded a total of 9905 genes that demon- 
strated RPKM expression values > 1 in both mouse and 
human beta cells and thus constitute the common beta 
cell transcriptome (Additional file 8). This extensive list 
includes many of the key genes that define beta cell 
identity such as Ins2IINS, Ucn3, Mafa, Nkx6.1, Pdxl, 
Erollb, Glplr, G6pc2, and the ATP-sensitive potasssium 
channel subunits Abcc8 and Kcnjll (Figure 5A). To 
identify the most striking species differences between 
mouse and human beta cells, we focused only on the 
genes that demonstrated both significant enrichment 
(p-value < 1 X 10"^, FDR < 0.1%) and over 10-fold dif- 
ference in beta cells of one species over the other. This 
approach identified 725 genes that were either uniquely 
expressed (human beta RPKM < 1, n = 569) or significantly 
enriched (human beta RPKM > 1, n = 156) in mouse beta 
cells (Figure 5 A, B; Additional file 9). Conversely, we iden- 
tified 815 genes as either uniquely expressed (mouse beta 
RPKM < 1, n = 666) or significantly enriched (mouse beta 
RPKM > 1, n = 149) in human beta cells (Figure 5 A, B; 
Additional file 9). Significant enrichment in gene expres- 
sion in one species correlated with the presence of Pdxl 
ChlP-Seq peaks specific to that species (Figure 5C) [38], 



which in turn correlated strongly with the presence of 
species-specific PDXl binding motifs (Figure 5D). This 
suggests that differences in the mouse and human PDXl 
cistrome contribute to the differential expression of genes 
between mouse and human beta cells, as is illustrated by 
the Sytl4 gene, which is uniquely expressed in mouse beta 
cells and is associated with a strong mouse-specific PDXl 
ChlP-Seq peak that overlies a consensus Pdxl binding 
motif in the mouse that is absent from human (Figure 5E). 

The small subset of 156 genes that is significantly 
(p < 1 X 10"^, FDR < 0.1%) and robustly (> 10-fold) enriched 
in mouse beta cells, but still detectable in human beta cells, 
constitutes less than 1.5% of the common beta cell tran- 
scriptome (Figure 5A). This cluster includes lapp, which is 
more than 30-fold reduced in mouse over human beta cells 
(Figure 5B, F), perhaps in adaptation to the amyloid pro- 
perties of human but not mouse lAPP that contribute to 
the onset and progression of type 2 diabetes [39]. The 
strongly disparate relative expression of the gene encoding 
lAPP in mouse and human islets is in contrast to other 
hormones such as INSIIns2 and UcnS, which are expressed 
at similar levels between mouse and human beta cells 
(Figure 5A, B). We further validated the relatively low ex- 
pression of lapp message by demonstrating that lAPP pep- 
tide content is reduced almost 28-fold in human islets 
compared to mouse, while INS and GCG content is similar 
in islets from both species (Figure 5G; Additional fde 10). 
A similarly small set of 149 genes that is detectably 
expressed in beta cells of both species, but robustly and 
significantly enriched in human beta cells includes Gad2 
(Figure 5B, H), which is a major auto-antigen in human 
type 1 diabetes [40,41]. 

Notable differences in cell surface receptors between 
human and mouse beta cells 

Several of the most notable gene expression differences 
between mouse and human beta cells concern cell sur- 
face receptors of the class I helical cytokine receptor 
family [42]. Prlr has long been implicated in beta cell 
mass expansion in adaptation to the increased metabolic 
demand of pregnancy in rodents in response to PRL or 
placental lactogens [43,44]. Indeed, Prlr is abundantly 
and selectively expressed in mouse beta cells (Figures 2E, 
5 A, B; Additional file 2). In contrast, while Prlr is detect- 
able in FACS -purified human beta cells and is associated 
with a human PDXl ChlP-Seq signature (Figure 51), it is 
expressed at approximately 15-fold lower RPKM values 
than mouse beta cells (Figure 5B, Additional file 9) and 
is significantly depleted compared to co-purified human 
non-beta cells (not shown). Similarly, related receptor 
genes for growth hormone (Ghr) (Figure 5J) and ciliary 
neurotrophic factor (Cntfr) (Figure 5K) are robustly 
expressed by mouse beta cells, but are virtually unde- 
tectable in human beta cells. 
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(See figure on previous page.) 

Figure 5 Comparison of the mouse and human beta cell transcriptomes. The unbiased comparison of mouse and liuman transcriptomes 
reveals a common core of over 9900 genes tliat are expressed at an RPKM value > 1 in beta cells of both species (A). A total of 725 genes (569 
unique and 156 enriched) was robustly (>10-fold) and significantly (p < 1 x 10"') higher expressed in mouse beta cells (B, red) while 815 genes 
(566 unique and 149 enriched) were by the same criteria significantly higher expressed in human beta cells (B, blue). Genes enriched in mouse 
beta cells were co-enriched for mouse-specific PDXl binding and vice versa (C). Moreover, enrichment of PDXl ChlP-Seq peaks in either species is 
associated with species-specific PDXl binding morifs (D), as illustrated for Syf/4 (E). lapp transcript levels in purified beta cells (F) and islet lAPP 
peptide content (G) are markedly reduced in human compared to mouse. Additional browser plots demonstrate notable examples of significantly 
enriched or selectively expressed genes in either species including Gad2 (H), Prir (I), Glir (J) and Cntfr (K). Human RNA-seq data are from [16], 
ChlP-Seq data are from [38,49]. See also Additional files 8, 9 and 10. 



Novel transcript discovery 

We leveraged our combined alpha and beta RNA-seq 
expression data for novel islet transcript discovery in the 
mouse islet. Our initial list of novel transcripts included 
73% of the IncRNAs identified in purified mouse beta 
cells [20] and 54% of the IncRNAs identified from total 
mouse islets [15]. We formulated a stringent set of 
criteria (see Methods) to generate a set of 145 novel 
IncRNA transcripts. Of these, 18 are positioned within 
the introns of known protein-coding genes, while the 
remaining 127 are situated in intergenic regions, the ma- 
jority of which were not identified in previous studies of 
mouse islet IncRNAs [15,20] (Additional file 11). The 
transcript ends for these novel islet transcripts were sig- 
nificantly enriched for transcription factor binding by 
PDXl (Figure 6A), NKX6.1 (Figure 6B), MAFA and 
NEURODl (not shown) compared to random positions. 
They also were enriched for histone H3 lysine 4 (H3K4) 
mono-methyl and tri-methyl signatures that mark tran- 
scriptionally active loci (Figure 6C) [45]. These observa- 
tions underscore that these novel IncRNAs are 
transcriptionally active and suggest transcriptional regu- 
lation by beta cell specific transcription factors. As 
IncRNAs are emerging as an additional layer in the regu- 
lation of gene expression of neighboring protein-coding 
genes, we compared the relative enrichment of IncRNAs 
in beta cells with the enrichment of their nearest 
protein-coding gene. This revealed that the ratio of ex- 
pression in mouse beta and alpha cells is correlated 
between these novel transcripts and their nearest protein- 
coding gene (Figure 6D). This general phenomenon has 
also been noted for human islet-specific IncRNAs [15] and 
is illustrated by novel IncRNAs associated with Illrl 
(Figure 7), Nkx6.1 (Figure 6E) and Pparg (Figure 6F), 
which are all co-enriched in mouse beta over alpha cells 
in concert with their closest protein-coding gene. To test 
whether these novel beta cell IncRNAs are subject to tran- 
scriptional regulation, we stimulated primary mouse islets 
overnight with 16.8 mM glucose and compared their ex- 
pression to non-stimulated control islets from the same 
pool. Several of our set of 145 IncRNAs were among the 
most significantly up regulated (Figure 6H-J) or down 
regulated (Figure 6K) by glucose of all coding and non- 
coding transcripts. 



The lnterleukln-1 receptor locus in beta cells 

The interleukin-1 receptor type 1 (//iri/CD121a) locus 
aptly recapitulates all aspects of our study. Illrl is the most 
abundantly expressed cell surface receptor on mouse beta 
cells by a considerable margin with an RPKM value > 200. 
Expression of Illrl is significantly enriched in mouse beta 
over alpha cells (p < 1.9 x 10 '^^) and the mouse Illrl pro- 
moter is associated with strong transcription factor binding 
by the beta cell transcription factors PDXl, NKX6.1 and 
NEURODl (Figure 7A), validating the beta cell as the 
source of Illrl expression. In sharp contrast, ILIRI expres- 
sion in human islet populations, while detectable, is signifi- 
cantly lower in human beta cells in comparison to both 
mouse beta cells (Figure 5A, B) and human alpha cells 
(Figure 7A, B). The high levels of Illrl message in mouse 
beta cells translate into abundant cell surface expression of 
ILlRl/CD121a on mCherry + beta cells as measured by 
FACS (Figure 6C) and confocal microscopy (Figure 7D, E). 
An additional notable feature unique to the mouse Illrl 
locus is a IncRNA positioned about 30 kb upstream of the 
Illrl transcription start site. This IncRNA demonstrates 
strong beta cell enrichment similar to the adjacent Illrl 
transcript and is associated with significant transcription 
factor binding by PDXl, NKX6.1, MAFA and NEURODl 
(Figure 7A). The corresponding region of the human 
ILIRI locus shows no human PDXl binding or evidence 
for the expression of a similar IncRNA in islet cells 
(Figure 7B). Moreover, expression of this mouse-specific 
IncRNA upstream of Illrl is the most significandy up 
regulated of all transcripts, coding and non-coding, follo- 
wing overnight stimulation of mouse islets with glucose 
(Figure 7F; Additional file 11). A repeat of 13 consensus 
binding sites for the transcription factor V-Ets Avian 
Erythroblastosis Virus E26 Oncogene Homolog 1 (ETSl) 
(Figure 6F) that is absent from the human ILIRI locus 
may contribute to both the expression of this IncRNA by 
mouse but not human beta cells and the robust glucose- 
induced up regulation of its expression, as Etsl expression 
is significantly up regulated by glucose (Figure 7G). 

Discussion 

To ensure tight coordination of the secretion of insulin 
and glucagon, pancreatic beta and alpha cells are intimately 
colocalized in the pancreas, which has long complicated 
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the process of obtaining highly pure populations of alpha 
and beta cells from isolated islets. We successfully over- 
came this problem by generating a mInsl-H2b-mCherry 
beta cell reporter line that enabled us to achieve over 98% 
beta cell purity, which is markedly higher than previous 
studies that used antibody-based strategies to purify beta 
cells from dissociated human islet suspensions [15,16,46]. 
The fact that both mouse insulin genes rank as the second 
and third most significantly enriched genes in mouse beta 
over alpha cells - trailing only the key beta cell-specific 
transcription factor Mafa - underscores our high quality of 
beta cell enrichment. Further confirmation that our tran- 
scriptomes faithfully recapitulate known expression pat- 
terns in alpha and beta cells follows from the fact that 
expression of key alpha and beta cell transcription factors 
as well key genes involved in glucose sensing, stimulus 



secretion coupling and insulin exocytosis generally adheres 
closely to the literature [2-5]. It is noteworthy that many 
additional genes involved in glycolysis, membrane depo- 
larization, calcium entry and insulin exocytosis are not, or 
only modestly enriched in mouse beta over alpha cells, 
suggestive of similarities in the triggering of hormone re- 
lease by either cell type as was recently reported [47]. 

Importantly, our transcriptome data now offer the 
opportunity for the systematic comparison of gene ex- 
pression by mouse and human beta cells. We identified 
a core set of 9906 genes that are shared between mouse 
and human beta cells. It is no surprise that this set con- 
tains many of the key genes required for the stimulus- 
appropriate release of insulin such as G6pc2, Erollb, 
Pcskl and Glplr. We also uncovered significant tran- 
scriptome differences between mouse and human beta 
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cells that are accompanied with species-specific en- 
richment of beta cell-specific transcription factors, and 
sometimes associated with species-specific IncRNAs. We 
minimized the potential caveats that can stem from dif- 
ferences in islet isolation, dissociation, FACS sorting and 
library preparation between laboratories by focusing only 



on robustly (10-fold enrichment) and significantly diffe- 
rentially expressed (p < 1 x 10 FDR < 0.1) genes. Further 
validation of transcriptome differences between mouse 
and human beta cell transcriptomes was obtained by de- 
monstrating that species-specific enrichment of gene ex- 
pression correlates well with species-specific enrichment 
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of Chip signatures for the beta cell specific transcription 
factors Pdxl [38] and Nkx6.1 [48,49]. Among the more 
strilcing examples are Prlr, Ghr and Cntfr, three related re- 
ceptors that respond to structurally similar ligands [42], 
that are abundandy and selectively expressed by mouse 
beta cells, with only nominal expression in human islets. 
Co-stimulation of CNTF and EGF promotes acinar to beta 
cell transdifferentiation in diabetic mice [50] and GH and 
PRL are potent inducers of mouse beta cell proliferation 
[51] and implicated in the expansion of maternal beta cell 
mass during pregnancy in mice [43,44]. However, efforts 
to use PRL or GH to induce human beta cell proliferation 
in vitro have been largely unsuccessful [52,53] and expan- 
sion of human beta cell mass during pregnancy is modest 
compared to mouse and driven by neogenesis instead of 
beta cell proliferation [54]. While there are clearly diffe- 
rences between mouse and human beta cells that prevent 
the latter from entering the cell cycle [55], the lack of 
GHR and relative absence of PRLR are not conducive to 
robust responses to their potential mitogenic ligands. 

The most abundantly expressed cell surface receptor in 
mouse beta cells, measured by gene expression, is Illrl, 
which responds to the pro-inflammatory cytokines IL-la 
and IL-ip. Importantly, the human islet is not devoid of 
ILIRI transcript, in line with a body of literature that 
established that mouse and human beta cells respond to 
sustained IL-ip stimulation with a reduction in function 
and an increase in apoptosis [56-58]. The robusdy ele- 
vated Illrl expression in mouse islets could betray a more 
prominent role for locally produced IL-ip in beta cell glu- 
cotoxicity in mouse islets and potentially render mouse 
beta cells more sensitive to IL-ip-mediated pro-inflamma- 
tory insults. This observation is important, as local inflam- 
mation in the islet precipitated by obese conditions is 
thought to contribute to beta cell failure and exacerbate 
diabetes [59]. 

Several of the genes that displayed markedly higher ex- 
pression in mouse over human beta cells, such as Illrl 
and Pparg, were flanked by mouse-specific novel IncRNAs. 
We therefore leveraged our transcriptome data to identify 
145 novel IncRNAs that are enriched for significant tran- 
scription factor binding of key beta cell transcription fac- 
tors and are co-regulated with the nearest protein-coding 
gene in line with previous observations [15,60,61]. As 
IncRNAs are not bound by the constraints of protein- 
coding genes, they tend to display less sequence conserva- 
tion and stability over longer phylogenetic distances [62]. 
Indeed, comparison of the set of 145 mouse IncRNAs with 
human [15] revealed evidence for a human orthologous 
IncRNA for only two. It appears that the repertoire of islet 
IncRNAs is dynamic and has changed considerably since 
the divergence of primates and rodents, in line with the 
general notion that the IncRNA repertoire across verte- 
brates is dynamic, even between closely related species 



[60,63,64]. Their generally high plasticity and poor se- 
quence conservation notwithstanding, IncRNAs are emer- 
ging as an important and novel transcriptional regulatory 
mechanism that is likely to significantly impact beta cell 
fate and function. Indeed, we observed that in response to 
stimulation with glucose, the quintessential beta cell trig- 
ger, quite a few IncRNAs were up- or down regulated by 
10-fold or more. The arduous task of attributing function 
to the beta cell-enriched IncRNAs that have emerged from 
ours and similar studies [15,20] is now underway. 

Conclusion 

Rodents are commonly used to study the islets of 
Langerhans, with the ultimate goal of improving the out- 
look of diabetic patients and despite considerable dif- 
ferences in islet architecture and innervation between 
rodent and primate islets. Our comprehensive beta and 
alpha cell transcriptomes will greatly enhance our under- 
standing of normal islet physiology and yield significant 
new leads to direct the behavior of its two primary endo- 
crine constituents. Importantly, our data leveraged against 
recently published transcriptomes for FACS-purified hu- 
man islet populations will now facilitate routine cross 
comparison between mouse and human alpha and beta 
cell transcriptomes. This is an invaluable resource for all 
with an interest in islet physiology and disease that stands 
to improve the translatability of rodent studies [65] by 
ensuring that potential therapeutic targets identified by 
preclinical experiments on rodents are similarly expressed 
by human beta cells. 

Methods 

Generation of a mlns1-H2b-mCherry beta cell reporter 
line 

Animals were maintained on a 12-h light/12-h dark cycle 
with free access to water and standard rodent how. DNA 
encoding a fusion between human histone-2b and mono- 
meric Cherry (H2b-mCherry) was ligated in the Spe I site 
of the mouse insulin 1 promoter (pBS.MIPl(-Spel); gene- 
rously gifted by Dr. Mark Magnuson). The resulting 
mInsl-H2b-mCherry reporter construct (Figure lA) was 
used for pronuclear injection by the Salk transgenic core, 
leading to two founder lines. One founder line demon- 
strated bright nuclear expression of mCherry in only a 
small minority of beta cells and was discarded. The other 
founder line was determined to have the transgenic cas- 
sette inserted at two separate loci, one of which was silent 
and was bred out. No fertility or viability issues were 
noted and reporter expression was only observed in the 
pancreas. Although no effect of transgene copy number 
was observed, only hemizygous offspring were used. The 
mInsl-H2b-mCherry reporter line was crossed to SlOOb- 
eGFP mice (The Jackson Laboratory, Bar Harbor, ME; 
strain 005621) [13]. Glucose tolerance tests were carried 
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out as previously described [12]. All procedures were 
approved by the Salk Institute for Biological Studies 
Institutional Animal Care and Use Committee. 

Islet isolation and FACS sorting 

Islet isolation was carried out as previously described [12]. 
Islets from mInsl-H2b-mCherry x SlOOb-eGFP bitrans- 
genic animals were pooled by sex in two replicate groups 
of 10 or 11 animals. In preparation for sorting, islets were 
hand-picked into a 15 ml conical tube and allowed to sedi- 
ment before excess media was aspirated. Islets were disso- 
ciated at 37°C by adding 0.25% trypsin-EDTA (Invitrogen) 
aided by regular but gentle mechanical dissociation with a 
p200 pipette. Dissociated islet cells were washed in HBSS 
containing 10% FES and sorted at the Salk Institute Flow 
Cytometry core on a FACS Vantage SE DiVa (Becton- 
Dickinson, Franklin Lakes, NJ) using 488 and 568 excita- 
tion lines for eGFP and mCherry, respectively. Sorted cells 
were collected directly in Trizol. Flow cytometric staining 
for Illrl/CD121a-APC (Biolegend, San Diego, CA) was 
conducted on dissociated mCherry + beta cells at 5 micro- 
gram/ml in 100 microliter HBSS + 10% FBS. For glucose 
stimulation, islets from 3 month old male C57B16 mice 
(Harlan, Indianapolis, IN) were cultured overnight in 
RPMI containing 11 mM glucose, before glucose depri- 
vation for 2 hours in RPMI containing 2.8 mM glucose 
followed by 12 h incubation with 2.8 mM or 16.8 mM glu- 
cose. One hundred islets were used per replicate. 

RNA isolation and library prep 

RNA was isolated from Trizol by a chloroform extraction, 
assisted by phase lock tubes. RNA was precipitated by 
isopropanol and cleaned up over an RNEasy microcolumn 
(Qiagen, Valencia, CA) per the manufacturer's instruc- 
tions, taking great care to avoid carry over of ethanol 
following the column washes. Following elution in 30 
microliter elution buffer, RNA quality was verified by 
BioAnalyzer (Agilent, Santa Clara, CA). Indexed sequen- 
cing libraries were constructed using the TruSeq RNA 
sample Prep Kit v2 (lUumina Inc. San Diego, CA) and se- 
quenced at 50 cycles, single read on an lUumina HiSeq 
2000 platform. 

RNA-Seq and ChlP-Seq analysis 

Sequencing reads from two beta cell populations and two 
alpha cell populations were mapped using STAR [66] to 
the mouse genome version mm9 (NCBI build 37). Over 
30 million reads were sequenced for each library with over 
83% of sequenced reads aligning uniquely (>93% align- 
ment overall). Bedtools [67] was used to create count ta- 
bles of the sorted bam files using reads aligning to RefSeq 
defined exons. DESeq [68] was used for statistical com- 
parison. See Additional file 12 for a count table with all 
genes in our alpha and beta transcriptomes. Raw RNA-seq 



sequence files of human beta and non-beta cells [16] and 
ChlP-Seq data for mouse and human PDXl [38], mouse 
and human NKX6.1 [48,49], and mouse MAFA and 
NEURODl [69], were remapped using STAR (RNA-seq) 
or bowtie2 (ChlP-Seq) to their respective genomes (hgl9, 
mm9). Normalized genome browser tracks were prepared 
using HOMER (http://homer.salk.edu) [70] and uploaded 
into the University of California Santa Cruz genome 
browser to generate browser plots. ChlP-Seq peaks were 
determined using HOMER. Individual read alignments, 
transcripts, and ChlP-Seq peak positions for each human 
sequencing experiment were converted to the mouse 
genome using the UCSC liftOver tool. Homologene was 
used to map gene identifiers between mouse and human. 
To discover species-specific PDXl peaks, human PDXl 
ChlP-Seq read alignments were converted to the mm9 
genome. Peaks were found for both species with respect 
to the mm9 genome and differentially bound peaks were 
defined by peak regions containing 4-fold more reads in 
one species relative to the other. Instances of the PDXl 
motif were found using HOMER. Peak positions were 
converted between mm9 and hgl9 genomes using the lift- 
Over tool and scanned independently to identify species- 
specific motifs. The MEME Suite package MCAST [71] 
was used for the identification of statistically significant 
consensus binding sites for the four transcription factors 
found to be differentially- regulated by glucose {Etsl, Pax6, 
Pdxl, Myc). The full mouse transcriptome was scanned 
using default parameters. The consensus site motifs used 
were taken from the JASPAR Database [72]. 

Novel transcript discovery 

To identify novel IncRNA, RNA-Seq libraries from all repli- 
cates of alpha and beta cells were combined into a single 
meta-experiment to maximize sensitivity. Two approaches 
where used to identify transcripts. First, Cufflinks [73] was 
used with default settings to identify transcripts de novo. 
Second, the findPealcs program in homer was used with 
"-style histone" and "-minDist 1500" to identify regions of 
continuous high read density that Cufflinks missed. This 
approach yielded 71,730 predicted transcripts, which were 
used to confirm the absence or presence of transcripts 
predicted by other studies. Because many of the 71,730 
transcripts found by HOMER and Cufflinks were short 
fragments of RNA found within introns or immediately up- 
stream and downstream of known transcripts, we adopted 
stringent filtering criteria to identify novel, high-quality 
IncRNAs. Novel transcripts must (1) not overlap with any 
known RefSeq or UCSC gene exon, (2) have a total tran- 
script length greater than 3 kb, (3) expression in either 
alpha or beta cell experiment > 1 RPKM. (4) and not over- 
lap with known rRNA loci. We also excluded transcripts 
with a PhastCon score > 0.4 which likely reflect mapping 
artifacts to pseudogenes instead of actual novel transcripts. 
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Additional file 4: Is a side-by-side comparison of our alpha and 
beta cell transcriptomes with the beta cell transcriptomes derived 
from MIP-GFP + beta cells [20]. 

Additional file 5: Is a table listing the most significantly enriched 
functional categories in the beta and alpha cell fractions. 

Additional file 6: Is a figure highlighting the differential expression 
of genes important for glucose uptake and glycolysis, stimulus 
secretion coupling and insulin exocytosis in mouse beta and alpha 
cells. 

Additional file 7: Is a figure containing single channel images for 
the immuno-fluorescent panels of Figure 4. 

Additional file 8: Is a table listing the genes comprising the 
common beta cell transcriptome as defined by expression > 1 RPKM 
in beta cells of both species. 

Additional file 9: Is a table listing genes that are significantly 
enriched or uniquely expression in mouse or human beta cells. 

Additional file 10: Is a side-by-side comparison of a standard curve 
of mouse and human lAPP as measured by ELISA, demonstrating 
1 00% cross-reactivity. 

Additional file 11: Is a table listing novel transcripts discovered in 
purified mouse beta and alpha cells and the regulation of their 
expression by glucose. 

Additional file 12: Is a count table with all genes in our alpha and 
beta transcriptomes. 



Islet peptide content determination 

Islets were sonicated in 300 |il of KREBS ringer buffer, then 
centrifuged 10 min at 8000 K. The supernatant was trans- 
ferred to a fresh tube. Stock samples were stored at -20 C. 
For each assay the samples were diluted in the assay buffer 
supplied with each individual kit. Human and mouse 
insulin, glucagon and lAPP islet peptide content were mea- 
sured with commercially available RIA or ELISA kits 
(EMD MUlipore, Billerica, MA) per the manufacturer's 
instructions. As cross-reactivity for mouse lAPP had not 
been determined by EMD Millipore, a separate rat/mouse 
lAPP (American Peptide Company, Sunnyvale, CA) stan- 
dard curve was run in equivalent doses and shown to be 
superimposable with the human lAPP curve supplied with 
the ELISA, confirming 100% cross-reactivity with mouse 
L\PP (Additional file 10). Mouse and human islet lAPP 
samples were each quantified according to their homolo- 
gous standard curve. 

Immunofluorescence 

Immunofluorescence was carried out as previously des- 
cribed [17] using commercial antisera against glucagon 
(guinea pig anti-glucagon at 1:7000; EMD Millipore, Billerica 
MA), somatostatin (sheep anti-somatostatin at 1:1000; 
American Research Products Inc., Waltham MA), insulin 
(chicken anti-insulin at 1:1000; Abeam) and ILlrl/ 
CD121a (armenian hamster anti-mouse Illrl/CD121a- 
APC at 1:100; Biolegend, San Diego, CA). Rabbit antisera 
against UCN3 (#7218, 1:2000) and CRH (rc70, 1:5000) 
were generated in house. Secondary antibodies were raised 
in donkey against each of these host species and were ob- 
tained from Jackson Immuno Research Laboratories (West 
Grove, PA) conjugated to Alexa Fluor-405, -488, -649 or 
CyS and used at 1:600. Dapi was applied as a nuclear stain 
at 1 microgram/ml and slides were embedded with Pro- 
long Gold anttfade reagent (Life Technologies, Carlsbad, 
CA). All imaging was carried out on Zeiss LSM780 con- 
focal microscopes at The Waitt Advanced Biophotonics 
Center Core facility of the SaUc Institute. 

Human islets 

Human donor islets were obtained through the Integrated 
Islet Distribution Program (IIDP) and declared exempt 
from approval by the Institutional Review Board of the 
Salk Institute for Biological Studies. 

Additional files 



Additional file 1: Is a figure validating the expression of eGFP in 
alpha cells of the SIOOb-eGFP reporter mouse. 

Additional file 2: Is a table listing the 100 most significantly 
enriched genes in mouse beta cells. 

Additional file 3: Is a table listing the 100 most significantly 
enriched genes in mouse alpha cells. 
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